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Abstract — Motivated by the need to monitor large-scale net- 
works, this paper addresses the problem of sparse recovery 
with graph constraints. Unlike conventional sparse recovery 
where a measurement can contain any subset of the unknown 
variables(nodes), we take an additive measurement over nodes 
only if they induce a connected subgraph. We provide explicit 
measurement constructions for several special graphs, and the 
number of measurements by our construction is less than that 
needed by existing random constructions. Moreover, our con- 
struction for a line network is provably optimal in the sense that it 
requires the minimum number of measurements. A measurement 
construction algorithm for general graphs is also proposed and 
evaluated. For any given graph G with n nodes, we derive bounds 
of the minimum number of measurements needed to recover any 
fc-sparse vector over G (Afjf „). Using the Erdos-Renyi random 
graph as an example, we characterize the dependence of M% n 
on the graph structure. Our study suggests that M^f n may serve 
as a graph connectivity metric. 

Index Terms — sparse recovery, compressed sensing, topological 
graph constraints, measurement construction. 



I. Introduction 

Network monitoring is a critical module in the operation and 
management of communication networks, where one keeps 
track of network state parameters, such as bandwidth utiliza- 
tion and queueing delay. Since measuring each component 
(e.g., router) in the network directly can be operationally 
costly, if feasible at all, the topic of inferring system internal 
characteristics from indirect end-to-end (aggregate) measure- 
ments becomes important. This area is known as network 
tomography, and has been extensively studied during the last 
decade or so (9j, HZ), (T5], (23), (27), EQ, 1591 . 

In many cases, the total number of aggregate measurements 
is much smaller than the number of components in a network. 
But we still hope to extract the status of each individual 
component with some prior knowledge of the unknown signal 
to recover. For instance, if the signal is sparse, i.e. most entries 
are zero, we can recover it exactly even though the number 
of measurements is much smaller than its dimension. For 
example, transmission delays in the communication networks 
can be represented by an approximately sparse signal, since 
only a small number of bottleneck links experience large de- 
lays, while the delay is approximately zero elsewhere. Sparse 
Recovery addresses the problem of recovering sparse signals 
from a smaller number of measurements, and has two different 
but closely related problem formulations. One is Compressed 
Sensing 0,|H3,[ID,ED,|2Q|,|21, where the signal is rep- 
resented by a high-dimensional real vector, and an aggregate 
measurement is the arithmetical sum of the corresponding real 
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entries. The other is Group Testing ll2D . 11221 . where the high- 
dimensional signal is binary and a measurement is a logical 
disjunction (OR) on the corresponding binary values. 

One key question in both compressed sensing and group 
testing is to design a small number of non-adaptive measure- 
ments (either real or logical) such that all the vectors (either 
real or logical) up to certain sparsity (the support size of 
a vector) can be correctly recovered. Most existing results, 
however, rely critically on the assumption that any subset 
of the values can be aggregated together ifTOl . (19) , which 
is not realistic in network monitoring problems where only 
objects that form a path or a cycle on the graph (T), (27l . or 
induce a connected subgraph can be aggregated together in the 
same measurement. Only a few recent works consider graph 
topological constraints, either in group testing ifTSl setup, 
especially motivated by link failure localization in all-optimal 
networks [3|, 1131 . [28], 1132) . [36), or in compressed sensing 
setup, with application in estimation of network parameters 
ED, (25), (29), (38). 

Though motivated by network monitoring problems, sparse 
recovery with graph constraints abstractly models scenarios 
when certain elements cannot be measured together in a 
complex system. These constraints can result from various 
reasons, not necessarily lack of connectivity. Therefore, our 
results can be potentially useful to other applications besides 
network tomography. 

Here are the main contributions of this paper. 

(1) We provide explicit measurement constructions for various 
graphs. Our construction for line networks is optimal in the 
sense that it requires the minimum number of measurements. 
For other special graphs, the number measurements by our 
construction is less than the existing estimates (e.g. 0~3), [38 1) 
of the number of measurements required to recover sparse 
vectors over graphs. (Section ITTTb 

(2) We propose a measurement design guideline based on 
r-partition for general graphs and further show some of its 
properties. (Section [IV- At 

(3) A simple measurement design algorithm is proposed for 
general graphs, and we evaluate its performance both theoret- 
ically and numerically. (Section [IV-BI and IVIIB 

(4) For Erdos-Renyi random graphs, we characterize the de- 
pendence of the number of measurements for sparse recovery 
on the graph structure. (Section W\ 

(5) Motivated by practical needs, we further propose measure- 
ment construction methods under additional graph constraints 
including measurement length constraint, and the requirement 
that each measurement should pass one of a fixed set of nodes. 
(SectionlVTli We also address the issue of sparse recovery when 
some critical measurements may contain errors. (Section [Vill i 



II. Model and Problem Formulation 

Consider a graph G = (V, E), where V denotes the set 
of nodes with cardinality |V| = n and E denotes the set of 
links. Each node i is associated with a real number Xi, and 
we say vector x = (xi,i = 1, n) is associated with G. Let 
T = {i | Xi 7^ 0} denote the support of x, and let ||x||o = l^l 
denote the number of non-zero entries of x, we say x is a 
fc-sparse vector if ||x||o = k. 

Let S C V denote a subset of nodes in G. Let Es denote 
the subset of links with both ends in S, then Gs — (S, Es) 
is the induced subgraph of G. We have the following two 
assumptions throughout the paper: 

(Al): A set S of nodes can be measured together in one 
measurement if and only if Gs is connected. 
(A2): The measurement is an additive sum of values at the 
corresponding nodes. 

(Al) captures the graph constraints. One practical example 
is a sensor network where the nodes represent sensors and the 
links represent feasible communication between sensors. For a 
set S of nodes that induce a connected subgraph, one node u 
in S, we call u an "agent", monitors the sum of node values in 
S. Every node in S obtains values from its children, if any, on 
the spanning tree rooted at u, sums them up with its own value 
and sends the sum to its parent. Then the central operator can 
obtain the sum of node values in S by only communicating 
with the agent u. (A2) follows from the additive property of 
many network characteristics, e.g. delays and packet loss rates 
|27|. Note that compressed sensing can also be applied to 
cases where (A2) does not hold, e.g., the measurements can 
be nonlinear as in (6), [33). 

Let y £ lZ m (to <C n) denote the vector of to mea- 
surements. A is an to x n measurement matrix with its zth 
row corresponding to the ith measurement, i.e., Aij = 1 
(i = 1, to, j = 1, n) if and only if node j is included in 
the ith measurement and = otherwise. We can write in 
the compact form that y = Ax. We say a measurement matrix 
A can identify all k-sparse vectors if and only if Ax\ ^ Ax2 
for every two different vectors xi and X2 that are at most k- 
sparse. This definition indicates that every vector x that is at 
most fc-sparse can be recovered from Ax via ^-minimization, 
which returns the sparsest vector among all vectors that can 
produce the same observation Ax. Sparse recovery theory 
indicates that it is possible to identify n-dimensional vectors 
from to (to -c ri) measurements provided that the vectors are 
sparse enough. 



TABLE I 
SUMMARY OF KEY NOTATIONS 




Fig. 1. Network Example 

With the above assumptions, A is a 0-1 matrix, and for each 
row of A, the set of nodes that correspond to ' 1 ' should form 
a connected induced subgraph of G. For example in Fig.Q] we 



Notation 


Meaning 


G s 


Subgraph of G induced by S 


Mi? 

k,n 


Minimum number of measurements needed to identify fc- 
sparse vectors associated with G of n nodes. 




Minimum number of measurements needed to identify fc- 
sparse vectors associated with a complete graph of n nodes. 


f{k,n) 


Number of measurements constructed to identify fc-sparse 
vectors associated with a complete graph of n nodes 



can measure the sum of nodes in Si and 52 by two separate 
measurements, and the measurement matrix is 



.4 



1110 110 
1 1 1 1 



We remark here that in group testing with graph constraints, 
the requirements for the measurement matrix A are the same, 
while group testing differs from compressed sensing only in 
that (1) x is a logical vector, and (2) the operations used in 
each group testing measurement are the logical "AND" and 
"OR". Here we consider compressed sensing if not otherwise 
specified, and the main results are stated in theorems. We 
sometimes discuss group testing for comparison, and the 
results are stated in propositions. Note that for recovering 
1-sparse vectors, the numbers of measurements required by 
compressed sensing and group testing are the same. 

Given a graph G with n nodes, let M^ n denote the 
minimum number of non-adaptive measurements needed to 
identify all fc-sparse vectors associated with G. Let Mu n 
denote the minimum number of non-adaptive measurements 
needed in a complete graph with n nodes. Since in a complete 
graph, any subset of nodes forms a connected subgraph, every 
0-1 matrix is a feasible measurement matrix there. Existing 
results 0, JIT], [37 1 show that with overwhelming probability 
a random 0-1 matrix with 0(fclog(n/fc)) rows3 can identify 
all fc-sparse vectors associated with a complete graph, and 
we can recover the sparse vector by l\ -minimization, which 
returns the vector with the least ^i-nor IT0 among those that 
can produce the obtained measurements. Then we have 



0(fclog(n/fc)). 



(1) 



Explicit constructions of measurement matrices for com- 
plete graphs also exist, e.g., (2), 0, HTJ, US), (37). We 
use f(k,n) to denote the number of measurements to re- 
cover fc-sparse vectors associated with a complete graph of 
n nodes by a particular measurement construction method. 
f(k, n) varies for different construction methods, and clearly 
f(k,n) > MP . Table UJ summarizes the key notations. 

The questions we would like to address in the paper are: 

• Given graph G, what is the corresponding M^ n l What 
is the dependence of MP on G? 

'We use the notations g(n) G 0(h(n)), g(n) G Q(h(n)), or g(n) = 
@(h(n)) if as n goes to infinity, g(n) < ch(n), g(n) > ch(n) or c\h(n) < 
g(n) < C2h(n) eventually holds for some positive constants c, Ci and C2 
respectively. 

2 The £p-norm (p > 1) of x is ||x|| p = \xi\ p ) 1 ^ p , IMIoo = 

max, \xi\, and ||x||rj = \{i : Xi ^ 0}|. 




Fig. 2. (a) line network (b) ring network 



• How can we explicitly design measurements such that the 
total number of measurements is close to n ? 

III. Sparse Recovery over Special Graphs 

In this section, we consider four kinds of special graphs: 
one-dimensional line/ring network, ring with each node con- 
necting to its four closest neighbors, two-dimensional grid and 
a tree. The measurement construction method for a line/ring 
network is different from those for the other graphs, and our 
construction is optimal (or near optimal) for a line (or ring) 
network. For other special graphs, we construct measurements 
based on the "hub" idea and will later extend it to general 
graphs in Section IPVl 



A. Line and Ring 

First consider a line/ring network as shown in Fig. [2] When 
later comparing the results here with those in Section II1I-BI 
one can see that the number of measurements required for 
sparse recovery can be significantly different in two graphs 
that only differ from each other with a small number of links. 

In a line/ring network, there is not much freedom in the 
measurement design since only consecutive nodes can be 
measured together from (Al). Recovering 1-sparse vectors 
associated with a line (or ring) network with n nodes is 
considered in J28), J22, which show that |"2±i] ( or [|]) 
measurements are both necessary and sufficient in this case. 
Here, we consider recovering fc-sparse vectors for fc > 2. 

Our construction works as follows. Given k and n, let t = 
Lf+rJ- ^ e construct n + 1 — Lf+yJ measurements with the 
ith measurement passing all the nodes from i to i + t — 1. Let 
^(n+i-tjxn k e tne corresponding measurement matrix, then 
its ith row has 'l's from entry i to entry i + t — 1 and 'O's 
elsewhere. For example, when k = 3 and n = 11, we have 
t = 3, and 

"11100000000 
01110000000 
00111000000 
00011100000 
I 00001110000 . (2) 
00000111000 
00000011100 
00000001110 
^00000000111 

Let M^ n and Mj} n denote the minimum number of mea- 
surements required to recover fc-sparse vectors in a line/ring 
network respectively. We have 



Theorem 1. Our constructed n + 1 — Lf+yJ measurements 
can identify all k-sparse vectors associated with a line/ring 
network with n nodes. Moreover, 



1 



M£ n = n + 1 - L-^yJ < M£„ + 1 



(3) 



Proof: We first prove that the constructed n + 1 — Lf+fJ 
measurements can identify all k-sparse vectors associated with 
a line/ring network with n nodes. 

The recovery is successful if and only if every 2k columns 
of A are linearly independent, i.e., every non-zero vector z 
such that Az = has at least 2fc + 1 non-zero elements fTOll . 

A is an identity matrix when t = 1, and the argument holds 
trivially. We focus on the case that t > 2. Now tk < n — 1 
holds from definition. For each index 1 < k' < k — 1, define 
a submatrix Ay , which consists of the first tk' + 1 rows and 
the first tk' + t columns of A. For example, for A in ©, 



1110 
1110 
1110 
1 1 1 



An 



1 1 1 
1 1 1 
1 1 1 



1 



110 




1 1 1 
1 1 1 







1 1 1 

We claim that: (*) every non-zero vector w such that 
AyW = has at least 2k' + 2 non-zero elements, and at 
least two non-zero elements exist in its last t entries. We will 
prove (*) by induction over fc'. 

First consider A\. Let af denote its ith row. Because any 
two columns of A\ are linearly independent, any w/0 such 
that A\w = must have at least three non-zero elements. Let 
j be the index of the last non-zero element of w. Suppose j < 
t, then ajw = wj ^ 0, contradicting the fact that Ai~w = 0. 
Then j > t+1 must hold, i.e., at least one of the last t entries 
of w is non-zero. From a^w = 0, at least two non-zero 
elements exist in the last t entries of w. One can similarly 
argue that at least two non-zero elements exist in the first t 
entries of w. Thus, (*) holds for A\. 

Now suppose (*) holds for Ay with integer fc' in [1, fc — 2]. 
Consider w ^ such that Ak'+i~w = 0. Let w denote the 
subvector of the first tk' + t entries of w, then Ayw = 0. 
We remark that Suppose w = 0, and let j denote the 

index of the first non-zero entry of w. Then the inner prodcuct 
of the (j + 1 — i)th row of A^'+i with w equals to Wj, which 
is non-zero, contracting the fact that Ak'+i\v = 0. 

Since w/O, from the induction hypothesis, it has at least 
2fc' + 2 non-zero elements, and at least two non-zero elements 
in its last t entries. Now consider the last 2t entries of w and 
the last t+1 rows of Ay+i. From a similar argument as for 
Ai, we know that w must have at least two non-zero elements 
in the last t entries. So w has at least 2(k' + 1) + 2 non-zero 
elements, and (*) holds for fc' + 1. 



By induction over fc', we conclude that (*) follows. Consider 
any z^O such that Az = 0. Let w contain its first kt entries. 
One can argue that w ^ 0. Then by (*), w has at least 2k 
non-zero entries. Let j denote the index of the last non-zero 
entry of w. Since the inner product between z and the jth row 
of A is zero, there exists at least one non-zero entry in the last 
n — kt entries of z. Thus, z has at least 2k +1 non-zero entries 
in total. This concludes the proof. 

We next prove that the number of measurements needed 
to recover k-sparse vectors associated with a line (or ring) 
networks is at least n + 1 — Lf+jJ ( or n ~ L'FFiJ--' 

Let A mxn denote a measurement matrix with which one 
can recover fc-sparse vectors associated with a line network 
with n nodes. Then every 2k columns of A must be linearly 
independent. We will prove that m > n + 1 — Lf+yJ. 

Let (3 l denote the ith column of A. Define a 1 = (3 1 , a 1 = 
pi_pi-i for aii 2 < i < n , and a n+1 = -f3 n . Define matrix 
pmx(n+i) _ ^ 1 < j < yj, -)- i). Since A is a measurement 
matrix for a line network, each row of P contains one '1' entry 
and one ' — 1' entry, and all the other entries must be 'O's. 

Given P, we construct a graph G eq with n + 1 nodes as 
follows. For every row i of P, there is an edge (j, fc) in G eq , 
where Pij = 1 and Pa- = —1. Then G eq contains m edges, 
and P can be viewed as the transpose of an oriented incidence 
matrix of G eq . Let S denote the set of indices of nodes in a 
component of G eq , then one can check that 



f3 l = 0. 



(4) 



Since every 2k columns of A are linearly independent, every 
k columns of P are linearly independent, which then implies 
that the sum of any k columns of P is not a zero vector. With 
©, we know that any component of G eq should have at least 
fc+1 nodes. Since a component with r nodes contains at least 
r — 1 edges, and G eq has at most Lf+yj components, then 
G eq contains at least n + 1 — Ltj+jJ edges. The claim follows. 

We next consider the ring network. Let A denote the 
measurement matrix with which one can recover fc-sparse 
vectors on a ring network with n nodes. Let (3 l denote the 
ith column of A. Define 5 1 = /3 1 - /3", and a 1 = ft - ft' 1 
for all 2 < i < n. Define matrix P mxn = {a\ 1 < % < n). 
Similarly, we construct a graph G eq with n nodes based on P, 
and each component of G eq should have at least fc + 1 nodes. 
Thus, G eq contains at most La^ptJ components, and has at 
least n — Lfe^rJ edges. Then we have 

i n i ^ i n+ 1 



M? n > n 



V 



jfe + 1- 

and the inequality of (01 holds. ■ 
Theorem Q] indicates that our construction is optimal for a 
line network in the sense that the number of measurements 
is equal to the minimum needed to recover fc-sparse vectors. 
It is near optimal for a ring network, since the number of 
measurements is no more than the minimum plus one. This 
improves over our earlier result (Theorem 1 in (34]), which 
does not have optimality guarantee. 

We can save about Lf+^J — 1 measurements but still be 
able to recover fc-sparse vectors in a line/ring network via 



compressed sensing. But for group testing on a line/ring 
network, n measurements are necessary to recover more than 
one non-zero element. The key is that every node should be 
the endpoint at least twice, where the endpoints are the nodes 
at the beginning and the end of a measurement. If node u is an 
endpoint for at most once, then it is always measured together 
with one of its neighbors, say v, if ever measured. Then when 
v is '1', we cannot determine the value of u, either '1' or 
'0'. Therefore, to recover more than one non-zero element, 
we need at least 2n endpoints, and thus n measurements. 

B. Ring with nodes connecting to four closest neighbors 

Consider a graph with each node directly connecting to its 
four closest neighbors as in Fig. H] (a), denoted by Q A . Q A is 
important to the study of small-world networks ll35l . Q A has 
n more links than the ring network, but we will show that 
the number of measurements required by compressed sensing 
to recover fc-sparse vectors associated with Q 4 significantly 
reduces from 0(n) to 0(k log(n/fc)). The main idea is 
referred to as " the use of a hub". 




Fig. 3. Hub S for T 



Definition 1. Given G = (V, E), S C V is a hub for T CV 

if Gs is connected, and Vw £ T, 3s £ S s.t. (u, s) £ E. 

We first take one measurement of the sum of nodes in S, 
denoted by s. For any subset W of T, e.g., the pink nodes 
in Fig. [3] S U W induces a connected subgraph from the hub 
definition and thus can be measured by one measurement. To 
measure the sum of nodes in W, we first measure nodes in 
S U W and then subtract s from the sum. Therefore we can 
apply the measurement constructions for complete graphs on 
T with this simple modification, and that requires only one 
additional measurement for the hub S. Thus, 

Theorem 2. With hub S, M^m +1 measurements are enough 
to recover k-sparse vectors associated with T. 

The significance of Theorem|2]is that Gt is not necessarily 
a complete subgraph, i.e., a clique, and it can even be 
disconnected. As long as there exists a hub S, the measurement 
construction for a complete graph with the same number of 
nodes can be applied to T with simple modification. Our later 
results rely heavily on Theorem |2] 

In Q A , if nodes are numbered consecutively around the ring, 
then the set of all the odd nodes, denoted by T Q , form a hub 
for the set of all the even nodes, denoted by T e . Given a fc- 
sparse vector x, let x and x e denote the subvectors of x 
with odd and even indices. Then x Q and x e are both at most 
fc-sparse. From Theorem |2j M^, n , 2 \ + 1 measurements are 




(a) Measure nodes 2,8 and 10 via hub T (b) Delete h long links 

Fig. 4. Sparse recovery on graph Q 4 

enough to recover x e € 7?.L™/ 2 J . Similarly, we can use T e as a 
hub to recover the subvector x D G 72. T™/ 2 ! with \n/2~\ + 1 
measurements, and thus x is recovered. 

Corollary 1. All k-sparse vectors associated with Q 4 can be 
recovered with MP, + Mf r , nl + 2 measurements, which 

re, I n/ 2 j k,\n/2\ ' 

is 0(2fclog(n/(2fc))). 

From ring network to Q 4 , although the number of links 
only increases by n, the number of measurements required to 
recover fc-sparse vectors significantly reduces from 0(n) to 
0(2k log(n/(2fc))). This value is in the same order as Afp , 
while the number of links in Q 4 is only 2n, compared with 
n(n — l)/2 links in a complete graph. 

Moreover, our estimate 0(2k log(n/(2fc))) on the minimum 
number of measurements required to recover fc-sparse vectors 
greatly improves over the existing results in ||T3l . [38 1, both 
of which are based on the mixing time of a random walk. The 
mixing time T(n) can be roughly interpreted as the minimum 
length of a random walk on graph G such that its distribution 
is close to the stationary distribution on G. Il38l proves that 
0(kT 2 (n) logn) measurements can identify fc-sparse vectors 
with overwhelming probability by compressed sensing. ifPTl 
needs 0(k 2 T 2 (n) log(n/fc)) measurements to identify fc non- 
zero elements by group testing. In Q 4 , T{n) should be at least 
n/A. Then both results provide no saving in the number of 
measurements for Q 4 as the mixing time is 8(ri). 

Besides the explicit measurement construction based on the 
hub idea, we can also recover fc-sparse vectors associated with 
Q 4 with O(logrt) random measurements. We need to point 
out that these random measurements do not depend on the 
measurement constructions for a complete graph. 

Consider an ?i-step Markov chain {Xk,l < fc < n} with 
X\ = 1. For any k < n — 1, if Xk = 0, then Xk+i = 1; 
if Xk = 1, then Xk+i can be or 1 with equal probability. 
Clearly any realization of this Markov chain does not contain 
two or more consecutive zeros, and thus is a feasible row of 
the measurement matrix. Moreover, 

Theorem 3. With high probability all k-sparse vectors asso- 
ciated with Q can be recovered with 0(g(k) logn) measure- 
ments obtained from the above Markov chain, where g{k) is 
a function of fc. 

Proof: See Appendix. ■ 
Adding n links in the form + 2 (mod n)) to the ring 
network greatly reduces the number of measurements needed 



from 8(n) to O(logn). Then how many links in the form 
+ 2 (mod n)) shall we add to the ring network such that 
the minimum number of measurements required to recover fc- 
sparse vectors is exactly 9 (log n)l The answer is n— 6 (log n). 
To see this, let denote the graph obtained by deleting h 
links in the form (i, i + 2(mod n)) from Q 4 . For example in 
Fig. IH (b), we delete links (3,5), (8,10) and (9,11) in red 
dashed lines from Q 4 . Given h, our following results do not 
depend on the specific choice of links to remove. We have 

Theorem 4. The minimum number of measurements required 
to recover k-sparse vectors associated with is lower 
bounded by \h/2~\, and upper bounded by 2M^„^ + h + 2. 

Proof: Let D denote the set of nodes such that for every 
i e D, link (i — 1, i + is removed from Q 4 . The proof of the 
lower bound follows the proof of Theorem 2 in |32l . The key 
idea is that recovering one non-zero element in D is equivalent 
to recovering one non-zero element in a ring network with h 
nodes, and thus \h/2] measurements are necessary. 

For the upper bound, we first measure nodes in D separately 
with h measurements. Let S contain the even nodes in D 
and all the odd nodes. S can be used as a hub to recover 
the fc-sparse subvectors associated with the even nodes that 
are not in D, and the number of measurements used is at 
most „, + 1. We similarly recover fc-sparse subvectors 
associated with odd nodes that are not in D using the set of 
the odd nodes in D and all the even nodes as a hub. The 
number of measurements is at most M^r„-. + 1. Sum them 
up and the upper bound follows. ■ 

Together with (Q~|i, Theorem [4] directly implies that if 
0(logn) links in the form + 2(mod n)) are deleted 
from Q 4 , then 0(logn) measurements are both necessary 
and sufficient to recover associated fc-sparse vectors for any 
constant fc. 

Since the number of measurements required by compressed 
sensing is greatly reduced when we add n links to the ring net- 
work, one may wonder whether the number of measurements 
needed by group testing can be greatly reduced or not. Our 
next result shows that this is not the case for group testing, 
please refer to Appendix for its proof. 

Proposition 1. L n /4J measurements are necessary to locate 
two non-zero elements associated with Q 4 by group testing. 

By Corollary Q] and Proposition Q] we observe that in Q 4 , 
with compressed sensing the number of measurements needed 
to recover fc-sparse vectors is (9(2fc log(n/(2fc))), while with 
group testing, 0(n) measurements are required if fc > 2. 

C. Two-dimensional grid 

Next we consider the two-dimensional grid, denoted by Q 2d . 
Q 2d has y/ri rows and ^Jn columns. We assume ^/n to be even 
here, and also skip '[•]' and '[_•_]' for notational simplicity. 

The idea of measurement construction is still the use of a 
hub. First, Let 5i contain the nodes in the first row and all 
the nodes in the odd columns, i.e., the black nodes in Fig. 
Then S\ can be used as a hub to measure fc-sparse subvectors 
associated with nodes in V\S\. The number of measurements 




Fig. 5. Two-dimensional grid Fig. 6. Tree topology 

is Mr, ,„ y— + 1. Then let So contain the nodes in the 

k,(n/2 — yn/2) ' 

first row and all the nodes in the even columns, and use S2 
as a hub to recover up to fc-sparse subvectors associated with 
nodes in V\S2- Then number of measurements required is also 
(n/2- v /n/2) Fi na lly^ use nodes in the second row as a 
hub to recover sparse subvectors associated with nodes in the 
first row. Since nodes in the second row are already identified 
in the above two steps, then we do not need to measure the 
hub separately in this step. The number of measurements here 
is MP Therefore, 

k, v n 

With 2MF /„ + MF 1— + 2 measurements one can 

k,n/2—s/n/2 k,^/n 

ecover k-sparse vectors associated with Q 2d . 
D. Tree 

Next we consider a tree topology as in Fig. [6] For a given 
tree, the root is treated as the only node in layer 0. The nodes 
that are t steps away from the root are in layer t. We say the 
tree has depth h if the farthest node is h steps away from the 
root. Let rij denote the number of nodes on layer i, and n = 1. 
We construct measurements to recover vectors associated with 
a tree by the following tree approach. 

We recover the nodes layer by layer starting from the root, 
and recovering nodes in layer i requires that all the nodes 
above layer i should already be recovered. First measure the 
root separately. When recovering the subvector associated with 
nodes in layer i (2 < i < h), we can measure the sum of any 
subset of nodes in layer i using some nodes in the upper layers 
as a hub and then delete the value of the hub from the obtained 
sum. One simple way to find a hub is to trace back from nodes 
to be measured on the tree simultaneously until they reach one 
same node. For example in Fig. [6] to measure the sum of nodes 
5 and 7, we trace back to the root and measure the sum of 
nodes 1,2, 3, 5, and 7 and then subtract the values of nodes 1, 
2, and 3, which are already identified when we recover nodes 
in the upper layers. With this approach, we have, 

2~2i=o MjF n measurements are enough to recover k-sparse 
vectors associated with a tree with depth h, where rii is the 
number of nodes in layer i. 

IV. Sparse Recovery over General Graphs 

In this section we consider recovering fc-sparse vectors 
associated with general graphs. The graph is assumed to be 
connected. If not, we design measurements to recover fc-sparse 
subvectors associated with each component separately. 

In Section IIV-AI we propose a general design guideline 
based on "r-partition". The key idea is to divide the nodes 



into a small number of groups such that each group can be 
measured with the help of a hub. Since finding the minimum 
number of such groups turns out to be NP-hard in general, 
in Section IIV-BI we propose a simple algorithm to design 
measurements on any given graph. 

A. Measurement Construction Based on r-partition 

Definition 2 (r-partition). Given G = (V, E), disjoint subsets 
Ni (i = 1, ...,r) of V form an r-partition of G if and only if 
these two conditions both hold: (1) UjLjiVi = V, and (2) Vi, 
V\Ni is a hub for N. 

Clearly, T a and T e form a 2-partition of graph Q 4 . With 
Definition |2] and Theorem we have 

Theorem 5. If G has an r-partition N (i — 1, ...,r), then the 
number of measurements needed to recover k-sparse vectors 
associated with G is at most 2~2l=i ^k\N\ + r > which is 
0(rk log(n/fc)). 

Another example of the existence of an r-partition is the 
Erdos-Renyi random graph G(n,p) with p > logn/n. The 
number of our constructed measurements on G(n,p) is less 
than the existing estimates in fHl . ||38| . Please refer to Section 
IV1 for the detailed discussion. 

Clearly, if an r-partition exists, the number of measurements 
also depends on r. In general one wants to reduce r so as 
to reduce the number of measurements. Given graph G and 
integer r, the question that whether or not G has an r-partition 
is called r-partition problem. In fact, 

Theorem 6. Vr > 3, r-partition problem is NP-complete. 

Please refer to Appendix for its proof. We remark that we 
cannot prove the hardness of the 2-partition problem though 
we conjecture it is also a hard problem. 

B. Measurement Construction Algorithm for General Graphs 

Section IIV-AI proposes the r-partition concept as a mea- 
surement design guideline. But finding an r-partition with 
the smallest r in general is NP-hard. Given a connected 
graph G, how shall we efficiently design a small number of 
measurements to recover fc-sparse vectors associated with G? 

One simple way is to find the spanning tree of G, and 
then use the tree approach in Section IIII-DI The depth of the 
spanning tree is at least R, where R — min ue y max„ 6 v d uv 
is the radius of G with d uv as the length of the shortest path 
between u and v. This approach only uses links in the spanning 
tree, and the number of measurements needed is large when 
the radius R is large. For example, the radius of Q 4 is ri/4, 
then the tree approach uses at least n/4 measurements, while 
0(2fc log(n/2fc)) measurements are already enough if we take 
advantage of the additional links not in the spanning tree. 

Here we propose a simple algorithm to design the measure- 
ments for general graphs. The algorithm combines the ideas 
of the tree approach and the r-partition. We still divide nodes 
into a small number of groups such that each group can be 
identified via some hub. Here nodes in the same group are the 
leaf nodes of a spanning tree of a gradually reduced graph. A 
leaf node has no children on the tree. 



Subroutine 1 Leaves(G, u) 
Input: graph G, root u 

1 Find a spanning tree T of G rooted at u by breadth-first 
search, and let S denote the set of leaf nodes of T. 

2 Return: 5" 



Subroutine 2 Reduce(G, u, K) 

Input: G = (V,E), H e for each e 6 E, and node u 

1 V = V\u. 

2 for each two different neighbors v and w of u do 

3 if (v,w) E then 

4 E = EU(v, w), K(y tW ) = K( ViU ) U K( UyW ) U {u}. 

5 end if 

6 end for 

7 Return: G, K 



Let G* = (V*,E*) denote the orginal graph. The algorithm 
is built on the following two subroutines. Leaves(G, u) returns 
the set of leaf nodes of a spanning tree of G rooted at u. 
Reduce(G, u, K) deletes u from G and fully connects all the 
neighbors of u. Specifically, for every two neighbors v and 
w of u, we add a link (v, w), if not already exist, and let 
K(v,iu) — K(v,v.) u K(v,w) u { u }> where for each link (s,t), 
K ( s jt ) denotes the set of nodes, if any, that connects s and t in 
the original graph G*. We record K such that measurements 
constructed on a reduced graph G can be feasible in G*. 

Given graph G*, let u denote the node such that 
max„ e y* d uv = R, where R is the radius of G*. Pick u 
as the root and obtain a spanning tree T of G* by breadth- 
first search. Let S denote the set of leaf nodes in T. With 
V*\S as a hub, we can design f(k, \S\) + 1 measurements 
to recover up to £;-sparse vectors associated with S. We then 
reduce the network by deleting every node v in S and fully 
connects its neighbors. For the reduced network G, we repeat 
the above process until all the nodes are deleted. Note that 
when designing the measurements in a reduced graph G, if 
a measurement passes link (v,w), then it should also include 
nodes in K( v>w ) so as to be feasible in the original graph G*. 

In each step tree T is rooted at node u where max„ e y d uv 



Algorithm 1 Measurement construction for graph G* 
Input: G* = (V*,E*). 

1 G = G*, K e = for each e G E 

2 while |V| > 1 do 

3 Find the node u such that max„ e y d uv = R G , where 
R G is the radius of G. S =Leaves(G, u). 

4 Design f(k, \S\) + 1 measurements to recover fc-sparse 
vectors associated with S using nodes in V\S as a hub. 

5 for each v in S do 

6 G = Reduce(G, v, K) 
i end for 

8 end while 

9 Measure the last node in V directly. 
10 Output: All the measurements. 



equals the radius of the current graph G. Since all the leaf 
nodes of T are deleted in the graph reduction procedure, the 
radius of the new obtained graph should be reduced by at least 
one. Then we have at most R iterations in Algorithm Q] until 
only one node is left. Clearly we have, 

Theorem 7. The number of measurements designed by Algo- 
rithm Q] is at most Rf(k, n) + R + 1, where R is the radius 
of the graph. 

We remark that the number of measurements by the span- 
ning tree approach is also no greater than Rf(k, n) + R + 1. 
However, since Algorithm 1 also considers links that are not 
in the spanning tree, we expect that for general graphs, it uses 
fewer measurements than the spanning tree approach. This is 
verified in Experiment 1 in Section IVIIII 

V. Sparse Recover over Random Graphs 

Here we consider measurement constructions over the 
Erdos-Renyi random graph G(n,p), which has n nodes and 
every two nodes are connected by a link independently with 
probability p. The behavior of G(n,p) changes significantly 
when p varies. We study the dependence of number of mea- 
surements needed for sparse recovery on p. 

A. np = j3 log n for some constant j3 > 1 

Now G(n,p) is connected almost surely [24 1 . Moreover, 
we have the following lemma regarding the existence of an 
r-partition. 

Lemma 1. When p = j3 log n/nfor some constant j3 > 1, with 
probability at least l — 0(n~ a ) for some a > 0, every set S of 
nodes with size \S\ = n/(/3 — e) for any e£ (0, /3 — 1) forms a 
hub for the complementary set T = V\S, which implies that 
G(n,p) has a \ 1 1 -partition. 

Proof: Note that the subgraph Gs is also Erdos-Renyi 
random graph in G(n/(f3 — e),p). Since p = (3\ogn/n > 
log(n/(/3 — e))/(n/(f3 — e)), Gs is connected almost surely. 

Let Pf denote the probability that there exists some u E T 
such that (u, v) ^ E for every v G S. Then 

Pf = £(1 -P) ISI = (1 - J^) n 0- - /31ogn/n)"/^- £ ) 

uET P 6 

= (1 - l/(/3 - e))n(l - p\ogn/n)f^'^- 

<(l-J—)ne-^ < (i _ _!_) n -«/W-e). 
p — e p — e 

Thus, S is a hub for T with probability at least 1 — 0(nT a ) 
for a = e/(/3 - e) > 0. Since the size of T is (1 - l/(/3 - 
e))n, G(n,p) has at most [" ^J-i l sucrl disjoint sets. Then 
by a simple union bound, one can conclude that G(n,p) has 
a [ a^_~l 1 1 -partition with probability at least 1 — 0(n~ a ). ■ 
For example, when f3 > 2, Lemma Q] implies that any two 
disjoint sets Ni and N 2 with |7Yx| = \N 2 \ = n/2 form a 
2-partition of G(n,p) with probability 1 — 0{nT a ). From 
Theorem [5] and Lemma Q] and let e — > 0, we have 

When p — j3 log n/n for some constant /3 > 1, all k- 
sparse vectors associated with G(n,p) can be identified with 



0(\-^j~\k\og(n/k)) measurements with probability at least 
1 — 0(n~ a ) for some a > 0. 

Ifl3l considers group testing over Erdos-Renyi random 
graphs and shows that 0{k 2 log 3 n) measurements are enough 
to identify up to k non-zero entries if it further holds that 
p = 8(fclog 2 n/n). Here with compressed sensing setup and 
r-partition results, we can recover fc-sparse vectors in lZ n with 
0(k\og(n/k)) measurements when p > logn/n. This result 
also improves over the previous result in ll38l . which requires 
0(k log 3 n) measurements for compressed sensing on G(n, p). 

B. np - logn +00, and np ^ n -> 

Roughly speaking, p is just large enough to guarantee 
that G(n,p) is connected almost surely [24|. The diameter 
D = max u l , d uv of a connected graph is the greatest distance 
between any pair of nodes, and here it is concentrated around 
logtogn amlost surely Q. We design measurements on G(n,p) 
with Algorithm 1. With Theorem [7] and the fact that the radius 
R is no greater than the diameter D by definition, we have 

When np — logn — > +00, and " — > 0, 

0(k log n\og(n/k)/ log logn) measurements can identify k- 
sparse vectors associated with G(n,p) almost surely. 

C. 1 < c = np < log n 

Now G(n,p) is disconnected and has a unique giant com- 
ponent containing (a + o(l))n nodes almost surely with a 
satisfying e~ ca — 1 — a, or equivalently, 

k—1 

and all the other nodes belong to small components. The 
expectation of the total number of components in G(n,p) 
is (1 - a - c(l - a) 2 / 2 + o(l))n [24|. Since it is neces- 
sary to take at least one measurement for each component, 
(1 — a — c(l — a) 2 /2 + o(l))n is an expected lower bound of 
measurements required to identify sparse vectors. 

The diameter D of a disconnected graph is defined to be 
the largest distance between any pair of nodes that belong 
to the same component. Since D is now 6(log n/ log(np)) 
almost surely [14|, then for the radius R of the giant com- 
ponent, R < D = 0(log n/ log(np)), where the second 
equality holds almost surely. We use Algorithm 1 to design 
measurements on the giant component, and then measure every 
node in the small components directly. Thus, fc-sparse vectors 
associated with G(n,p) can be identified almost surely with 
0(k log n log(n/fc)/ log(np)) + (l — a+o(l))n measurements. 

Note that here almost surely the size of every small com- 
ponent is at most l °pX-i^£) ( Lemma 5, ED). If fc = 
fi(logn), almost surely (1 — a + o(l))n measurements are 
necessary to identify subvectors associated with small com- 
ponents, and thus necessary for identifying fc-sparse vectors 
associated with G(n,p). Combing the arguments, we have 

When 1 < c = np < log n with constant c, we can identify 
k-sparse vectors associated with G(n,p) almost surely with 
0(k log n log(n/fc)/ log(np)) + (l — a+o(l))n measurements. 



(l — a—c(l — a) 2 /2+o(l))nisan expected lower bound of the 
number of measurements needed. Moreover, if k = fi(logn), 
almost surely (1 — a + o(l))n measurements are necessary to 
identify k-sparse vectors. 

D. np < 1 

Since the expectation of the total number of components in 
G{n,p) with np < 1 is n - pn 2 /2 + 0(1) (24), then n - 
pn 2 /2 + 0(1) is an expected lower bound of the number of 
measurements required. Since almost surely all components 
are of size O(logn), then we need to take n measurements 
when fc — ri(logn). Therefore, 

When np < 1, we need at least n — pn 2 /2 + 0(1) 
measurements to identify k-sparse vectors associated with 
G(n,p) in expectation. Moreover, when k = f2(logn), n 
measurements are necessary almost surely. 

VI. Adding additional graph constraints 

Our constructions are based on assumptions (Al) and (A2). 
Here we consider additional graph constraints brought by 
practical implementation. We first consider measurement con- 
struction with length constraint, as measurements with short 
length are preferred in practice. We then discuss the scenario 
when only a subset of nodes can act as agents and each 
measurement should pass at least one agent. 

A. Measurements with short length 

We have not imposed any constraint on the number of nodes 
in one measurement. In practice, one may want to take short 
measurements so as to reduce the communication cost and the 
measurement noise. We next consider sparse recovery with 
additional constraint on measurement length, and we discuss 
two special graphs. 

1) Line and Ring: The construction in Section IIII-AI is 
optimal for a line network in terms of the number of measure- 
ments needed, and the length of each measurement is Lf+rJ' 
which is proportional to n when fc is a constant. Here we 
provide a different construction such that the total number of 
measurements needed to recover associated fc-sparse vectors 
is fcT-^pj-l + 1, but each measurement measures at most fc + 2 
nodes. We also remark that the number of measurements by 
this construction is within the minimum plus max(fc— 1, 1) for 
a line network, and the minimum plus fc for a ring network. 

We construct the measurements as follows. Given fc, let B k 
beafc + lbyfc + 1 square matrix with entries of ' 1 ' on the 
main diagonal and the first row, i.e. = 1 and = 1 
for all i. If fc is even, let B^-n = 1 for all 2 < i < k + 1; 
if fc is odd, let B k r u = 1 for all 2 < i < fc. B% = 
elsewhere. Let t — [-^Tyl, we construct a (kt + l) by (k + l)t 
matrix A based on B k . Given set S C {1, kt + 1} and set 
T C {1, (fc + l)i}, Ast is the submatrix of A with row 
indices in S and column indices in T. For all i — l,...,t, let 
S, = {(i - l)fc + l,...,ik + 1}, and let T t = {(fc + - 
1) + 1, (fc + Define A SzTt = B k for all i. All the 
other entries of A are zeros. We keep the first n columns of 
A as a measurement matrix for the line/ring network with n 



nodes. Note that the last one or serval rows of the reduced 
matrix can be all zeros, and we just delete such rows, and let 
the resulting matrix be the measurement matrix. For example, 
when k — 2 and n = 9, we have t = 3, and 



B 



1 1 1 
1 1 
1 1 



and 







A 



1 1 1 

1 1 
11111000 

1 1 

1 1 1 1 1 

1 1 

1 1 

When k — 3, and n = 8, we have t = 2 and 



(5) 



B 3 = 



1111 
110 
110 
1 
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Each measurement measures at most k + 2 nodes when k is 
even and at most k + 1 nodes when k is odd. We have, 

Theorem 8. The above construction can recover k-sparse 
vectors associated with a line/ring network with at most 
^l~FPrl measurements, which is within the minimum num- 
ber of measurements needed plus k. And each measurement 
measures at most k + 2 nodes. 

Proof: We only need to prove that all fc-sparse vectors in 

n (k+i)t 

can be identified with A, which happens if and only 
if for every vector z ^ such that Az = 0, z has at least 
2k + 1 non-zero elements. 

If i = 1, Aak + lbyk + 1 full rank matrix, and the claim 
holds trivially. We next consider t > 2. We prove the case 
when k is even, and skip the similar proof for odd k. 

For each integer t' in [2,t], define a submatrix A t > formed 
by the first kt' + 1 rows and the first (k + l)t' columns of A. 
For example, for A in we define 



-4, 



1110 
1 1 
11111 
1 1 
1 1 



and Aa 



A. 



We will prove by induction on t' that (#) every non-zero 
vector z € 7J.(' £ + 1 ) t such that A t /z = holds has at least 
2k + 1 non-zero elements for every t' in [2,t]. 

First consider A 2 , which is a (2k + 1) x (2k + 2) matrix. 
From the last k rows of A 2 , one can easily argue that for every 
z such that A 2 z = 0, its last k + 1 entries are either all zeros 
or all non-zeros. If the last k + 1 entries of z are all zeros, let 
z' denote the subvector containing the first k + 1 entries of z. 



Then we have = A 2 z = B z' . Since B is full rank, then 
z' = 0, which implies that z = 0. 

Now consider the case that last k + 1 entries of z are all 
non-zeros. Since k + 1 is odd, the sum of these entries is, 



2k+2 

22 Zi = z k+2 ^ o. 

i=k+2 



(6) 



Let af (i = 1, 2k + 1) denote the ith row of A 2 . We have 



2k+2 



iz = Zi = o. 



Combining © and we know that 

Zk + Zk+l = -Zk+2 ^ 0. 



(7) 



(8) 



Thus, at least one of z k and Zk+i is non-zero. Combining ([8]) 
with &J z = 0, we have one of the first k — 1 entries of z is 
non-zero. From &J z = for 2 < i < k — 1, one can argue 
that if one of the first k — 1 entries of z is non-zero, then all 
the first k — 1 entries are non-zero. Therefore, z has at least 
2k + 1 nonzero entries. (#) holds for A 2 . 

Now suppose (#) holds for some t' in [2,t — 1]. Consider 
matrix Af+i. Same as the arguments for A 2 , one can show 
that for every z ^ such that Af+\z — 0, its last k + 1 
entries are either all zeros or all non-zero. In the former case, 
let z' denote the subvector containing the first (k + l)t' entries 
of z. By induction hypothesis, z' has at least 2k + 1 nonzero 
entries, thus so does z. 

If the last k + 1 entries of z are all non-zero, like in the 
A 2 case, we argue that the sum of z^ k+1 - )t ,_ 1 and z^ k+ i) t , is 
non-zero, which implies that at least one of them is non-zero. 
Also consider a^z = with i = rk + 1 for every integer r in 
[0, tf — l], one can argue that there exist j in [0, t' — 1] such that 
the sum of all k — 1 entries from ^(fc+ij+i to Zj^+i)+k-i is 
non-zero. Then, from a^z = for i = jk + 2, jk + k — 1, 



we know that if the sum of z 



i(fe+i)+i 



to z 



j(k+l)+k-l 



zero, every entry is non-zero. We conclude that in this case z 
also has at least 2k + 1 nonzero entries. 

By induction over t', every z^O such that Az = has at 
least 2k + 1 non-zero entries, then the result follows. ■ 

This construction measures at most k + 2 nodes in each 
measurement. If measurements with constant length are pre- 
ferred, we provide another construction method such that every 
measurement only measures at most three nodes. This method 
requires more measurements, (2k — 1) \t^~\ + 1 measurements 
to recover fc-sparse vectors associated with a line/ring network. 

Given fc, let D k be a 2fc by 2fc square matrix having 
entries of '1' on the main diagonal and the subdiagonal and 
'0' elsewhere, i.e. = 1 for all i and D^_^ — 1 for 
all i > 2, and D& = elsewhere. Let t = |~^~|, we 
construct a (2kt — t + 1) by 2kt matrix A based on D k . 
Let Si = {(i - l)(2fc - 1) + l,...,i(2fc - 1) + 1}, and let 
Ti = {2k(i - 1) + 1, 2fci}. Define A Sz t z = D k for all 
i = l,...,t, and A^ = elsewhere. We keep the first n 
columns of A as the measurement matrix. For example, when 



k = 2 and n = 8, we have 



D 2 = 



10 
110 
110 
11 



and 



.4 



1 

1 1 

1 1 

1 1 1 

1 1 

1 1 

1 1 



(9) 



Theorem 9. The above constructed (2k — 1) \^] + 1 measure- 
ments can identify k-sparse vectors associated with a line/ring 
network of n nodes, and each measurement measures at most 
three nodes. 

Proof: When t = 1, A is a full rank square matrix. We 
focus on the case that t > 2. For each integer t' in [2, t], define 
a submatrix A t > formed by the first 2ki! — if + 1 rows and the 
first 2kt' columns of A. We will prove by induction on t' that 
every z ^ such that A t >z = holds has at least 2k + 1 
non-zero elements for every t' in [2,t]. 

First consider A%. For A in (O, A-i = A. From the first 
2k — 1 rows of A2, one can check that for every z such that 
A2Z = 0, its first 2k — 1 entries are zeros. From the 2fcth row 
of A2, we know that z-ik and z^k+i are either both zeros or 
both non-zero. In the former case, the remaining 2k— 1 entries 
of z must be zeros, thus, z = 0. In the latter case, one can 
check that the remaining 2k — 1 entries are all non-zero, and 
therefore z has 2k + 1 non-zero entries. 

Now suppose the claim holds for some if in [2,t — 1]. 
Consider vector z^O such that A t > + iz = 0. If Z2fe*'+l = 0, 
it is easy to see that the last 2k entries of z are all zeros. Then 
by induction hypothesis, at least 2fc + l entries of the first 2kt' 
elements of z are non-zero. If z-zkt'+i 7^ 0, one can check that 
the last 2k— 1 entries of z are all non-zero, and at least one of 
Z2H'-i and z 2kt' is non-zero. Thus, z also has at least 2k + 1 
non-zero entries in this case. 

By induction over t', every z^O such that Az = has at 
least 2k + 1 non-zero entries, then the theorem follows. ■ 

The number of measurements by this construction is greater 
than those of the previous methods. But the advantage of this 
construction is that the number of nodes in each measurement 
is at most three, no matter how large n and k is. 

2) Ring with each node connecting to four neighbors: We 
next consider Q 4 in Fig.|4](a). We further impose the constraint 
that the number of nodes in each measurement cannot exceed 
d for some predetermined integer d. We neglect |_ - J and [•] 
for notational simplicity. 

All the even nodes are divided into n/d groups, each of 
which contains d/2 consecutive even nodes. We use each 
group as a hub to measure d/2 odd nodes that have direct links 
with nodes in the hub. Then we can identify the values related 
to all the odd nodes with nM^ d ^ 2 /d + n/d measurements, 
and the number of nodes in each measurement does not 



exceed d. We then measure the even nodes with groups of 
odd nodes as hubs. In total, the number of measurements is 
2nM<f d/2 /d + 2n/d, which is 0(2kn\og{d/2)/d). When d 
equals to n, the result coincides with Theorem Q] Since n/d 
measurements are needed to measure all the nodes at least 
once, we have 

Theorem 10. The number of measurements needed to recover 
k-sparse vectors associated with with each measurement 
containing at most d nodes is lower bounded by n/d, and 
upper bounded by 0(2knlog(d/2)/d). 

The ratio of the number of measurements by our construc- 
tion to the minimum number needed with length constraint is 
within Cfelog(d/2) for some constant C. 

B. Measurements passing at least one node in a fixed subset 

Recall that for a connected subgraph G5, agent u in S 
aggregates the sum of all the nodes in S, and sends the mea- 
surement to the central operator. One may want to reduce the 
number of agents in a network so as to reduce the monitoring 
and transmission costs. If only a small set Y of nodes can 
perform as the agents, every constructed measurement is then 
required to pass at least one node in Y. 

This can be achieved with small modification to the con- 
struction algorithm. Let Y denote the set of agents. Let D 
denote the set of nodes that can be measured together via a 
hub H. If H n Y is not empty, every measurement contains 
at least one node in Y automatically as all the nodes in H 
are included. Now consider the case that H n Y is empty. If 
there exists a path P from some node j in H to some node 
/ in Y such that P n D is empty, then let H := H U P be 
the new hub, and design measurements for D using hub H. 
Then every measurement contains all nodes in H and thus 
the agent /. If such a path does not exist, pick any node i in 
D and any node / in Y, find the shortest path P' between 
i and /. Let H' := H U P' be the hub, and let D' := D\i 
be the set of nodes that can be measured via H'. Then every 
measurement contains agent /. Node i can be measured by two 
additional measurements, one measures path P', and the other 
measures P'\i, Therefore, with this simple modification, we 
can measure the same set of nodes with each measurement 
contains at least one node in Y, and the total number of 
measurements increases by at most two. 

We summarize the above modification in subroutine Agent. 
For measurement design on general graphs, we first replace 
step 4 in Algorithm 1 in Section IIV-BI with subroutine 
Agent(V^\S f , S, Y, G). Then in each iteration the number of 
measurements is increased by at most two. We then replace 
step 9 with measuring the paths P* and P*\nu st , where ni as t 
is the last node in G, and P* connects rti ast to any node 
j in Y on the original graph. Therefore, the total number 
of measurements needed by the modified algorithm is upper 
bounded by Rf(k, n) + 3R + 2, and each measurement in the 
modified version contains at least one node in Y. 

VII. Sensitivity to hub measurement errors 

In constructions based on the use of a hub, in order to 
measure nodes in S using hub H, we first measure the sum 



Subroutine 3 Agent(H, D, Y, G) 



Input: hub H, set D of nodes, set Y of agents, G 

1 if H n Y ^ $ then 

2 Design /(fc, \D\) + 1 measurements to recover fc-sparse 
vectors associated with D using H as a hub. 

3 else 

4 Find the shortest path between every node in H and 
every node in Y. 

5 if there exists a shortest path P s.t. PnD = $ then 

6 Design /(fc, |D|) + 1 measurements to recover nodes 
in D using H = H U P as a hub. 

7 else 

8 pick a node i in D and a node / in Y, find the 
shortest path P' between i and /. 

9 £>' := W := HU P', design /(fc, \D'\) + 1 
measurements to recover D' with iJ' as a hub. 

10 Measure P' and P'\i to recover node i. 
n end if 
12 end if 



of nodes in H, and then delete it from other measurements to 
obtain the sum of some subset of nodes in S. This arises the 
issue that if the sum of H is not measured correctly, this single 
error would be introduced into all the measurements. Here we 
prove that successful recovery is still achievable when a hub 
measurement is erroneous. 

Mathematically, let xg denote the sparse vector associated 
with S, and let xjj denote the vector associated with H and let 
j^mx\s\ ^ e a measurement matrix that can identify fc-sparse 
vectors associated with a complete graph of \S\ nodes. We 



arrange the vector x such that x 

A w mx<[H<[ 



F 



L \H\ 



is the measurement matrix for detecting fc non-zeros in S using 
hub H, where W is a matrix with all Ts, is a column vector 
of all '0's, and 1 is a column vector of all Ts. Let vector z 
denote the first in measurements, and let zq denote the last 
measurement of the hub H. Then 

I- l T x#l m 



z 




z ° _ 





Ax s 



or equivalently 



z - z Q l m = Ax s . 



(10) 



If there is some error eo in the last measurement, i.e., instead 
of zq, the actual measurement we obtain is 



20 



eo, 



eo hurts the recovery accuracy of xg through ( fTOb . 

To eliminate the impact of eo, we model it as an entry of 
an augmented sparse signal to recover. Let x' = [x T eo] T , 
and A' = [A - 1, 



,], we have 
A'y:' = z — zq1 t 



(11) 



Then, recovering xs in the presence of hub error eo is 
equivalent to recovering fc + 1-sparse vector x' from ( fTTT i. 



We consider one special construction of matrix A mx \ s \ for 
a complete graph. A has ' 1 ' on every entry in the last row, and 
takes value '1' and '0' with equal probability independently 
for every other entry. A' = [A — l m ], let A be the submatrix 
of the first m — 1 rows of A'. Let y = z — Zol m , and let y 
denote the first m — 1 entries of y. We have, 

(2i-^ m ^ 1 ) x l s l)x' = 2y~y m . 

We recover x' by solving the i\ -minimization problem, 

min ||x||i, s.L (2A-W^ m - 1 '> x ^ = 2y-y m . (12) 

Theorem 11. With the above construction of A, when m > 
C(fc + l)log|5| for some constant C > and \S\ is large 
enough, with probability at least 1 — OdS*! - ") for some 
constant a > 0, x' is the unique solution to M2\) for all fc + 1- 
sparse vectors x' in TZ' S ' +1 . 

Theorem[TT]indicates that even though the hub measurement 
is erroneous, one can still identify fc-sparse vectors associated 
with S with 0((k + 1) log |5|) measurements. 

The proof of Theorem [TT] relies heavily on Lemma 

Lemma 2. If matrix $p x ™ takes value —l/^/p on every 
entry in the last column and takes value ±l/^/p with equal 
probability independently on every other entry, then for any 
5 > 0, there exists some constant C such that when p > 
C(fc + 1) logn and n is large enough, with probability at least 
1 — 0(n~ a ) for some constant a > it holds that for every 
U C {1, n} with \U\ < 2fc + 2 and for every x € K 2k + 2 , 

(l-<*)||x||! < ll^xlll < (1 + ^)11x111, d3) 

where $y M the submatrix of $ with column indices in U. 

Proof: Consider matrix Q'P xn with each entry taking 
value ±l/y/p with equal probability independently. For every 
realization of matrix <J?', construct a matrix $ as follows. For 
every ie{l,...,p} such that & ln =l/y/p, let $j, = for 
all j = 1, n. Let $y = ■ for every other entry. One can 
check that $ and $ follow the same probability distribution. 
Besides, according to the construction of $, for any subset 
UC{l,...,n}, 

$' t/ T $' t/ = ®l$u. (14) 

The Restricted Isometry Property [10| indicates that the 
statement in Lemma [2] holds for From ( TBl i. and the fact 
that H^xlll = x T <&' c/ T $' c/ x, the statement also holds for 4. 
Since $ and $ follow the same probability distribution, the 
lemma follows. ■ 
Proof: (of Theorem [TTT i From Lemma [2] when m > 
C(k + l)log|5| for some C > and \S\ is large enough, 
with probability at least 1 - 0(\S\~ a ), matrix (2 A - 
W( m-1 ) x l s 'l) j / \frn — 1 satisfies ( fT3l ) for some small enough 
S, say S < \[2— 1. Then from ifTTIl . 11261 . dTzb can recover all 
fc + 1-sparse vectors correctly. ■ 

VIII. Simulation 

Experiment 1 (Effectiveness of Algorithm 1): Given a graph 
G, we apply Algorithm 1 to divide the nodes into groups 
such that each group (except the last one) can be measured 




via some hub. The last group contains one node and can be 
measured directly. It is know that M± n — [~log(n + 1)], 
and the corresponding measurement matrix has the binary 
expansion of integer i as column i ED . Also from ([TJ 
the number of measurements required to recovery fc-sparse 
vectors is within a constant times kM^ n . Therefore, here we 
design measurements to recover 1-sparse vectors on G as an 
example. The total number of constructed measurements is 
^' -1 [log(ni + 1)] + q, where n,i is the number of nodes in 
group i and q is the total number of groups. 

In Fig. 6, we gradually increase the number of links in 
a graph with n = 1000 nodes. We start with a uniformly 
generated random tree, and in each step randomly add 25 links 
to the graph. All the results are averaged over one hundred re- 
alizations. The number of measurements constructed decreases 
from 73 to 30 when the number of links increases from n — 1 
to 2n — 1. Note that the number of measurements is already 
within 3A/f n when the average node degree is close to 4. The 
radius R of the graph decreases from 13 to 7, and we also plot 
the upper bound R [log n\ +R+ 1 provided by Theorem|7] One 
can see that the number of measurements actually constructed 
is much less than the upper bound. 

In Fig. 7, we consider the scale-free network with Barabasi- 
Albert (BA) model J4) where the graph initially has mo 
connected nodes, and each new node connects to m existing 
nodes with a probability that is proportional to the degree of 
the existing nodes. We start with a random tree of 10 nodes 
and increase the total number of nodes from 64 to 1024. Every 
result is averaged over one hundred realizations. Since the 
diameter of BA model is 0(logn/loglogn)) 0, then by 
Theorem [7] the number of our constructed measurements is 
upper bounded by 0(log 2 nj log log n)). As the mixing time 
of BA model is O(logn) [301, methods in (B) and (38) 
require 0(log 3 n) random measurements. 

Experiment 2 (Recovery Performance with Hub Error): 

We generate a graph with n — 500 nodes from BA model. 
Algorithm 1 divides nodes into four groups with 375, 122, 2 
and 1 node respectively. For each of the first two groups with 
size m (i — 1,2), we generate [n.j/2] random measurements 
each measuring a random subset of the group together with its 
hub. Every node of the group is included in the random subset 
independently with probability 0.5. We also measure the two 
hubs directly. Each of the three nodes in the next two groups is 



measured directly by one measurement. The generated matrix 
A is 254 by 500. We generate a sparse vector xo with i.i.d. 
zero-mean Gaussian entries on a randomly chosen support, 
and normalize 1 1 xo 1 1 2 to 1. 

To recover Xo from y = Axq, one can run the widely 
used l\ -minimization ifTTl to recover the subvectors associated 
with the first two groups, and the last three entries of Xo 
can be obtained from measurements directly. However, as 
discussed in Section IVHI an error in a hub measurement 
degrades the recovery accuracy of subvectors associated with 
that group. To address this issue, we use a modified £%- 
minimization in which the errors in the two hubs are treated 
as entries of an augmented vector to recover. Specifically, let 
the augmented vector z = [x r ,ei,e2] T and the augmented 
matrix A = [A (3 7], where e\ (or e-i) denotes the error in the 
measurement of the first (second) hub, and the column vector 
(3 (or 7) has ' 1' in the row corresponding to the measurement 
of the first (or second) hub and '0' elsewhere. We then recover 
z (and thus xo) from y = Az by running l\ -minimization on 
each group separately. 

Fig. 8 compares the recovery performance of our modified 
l\ -minimization and the conventional l\ -minimization, where 
the hub errors e\ and e-i are drawn from standard Gaussian 
distribution with zero mean and unit variance. For every 
support size k, we randomly generate two hundred fc-sparse 
vectors Xq, and let x, denote the recovered vector. Even with 
the hub errors, the average ||x r — X0II2/HX0H2 is within 10 -6 
when xo is at most 35-sparse by our method, while by iy- 
minimization, the value is at least 0.35. We also consider 
the case that besides errors in hub measurements, every other 
measurement has i.i.d. Gaussian noise with zero mean and 
variance 0.04 2 . The average ||x r — 5C 1| 2 / II 1 1 2 here is smaller 
with our method than that with l\ -minimization. 

IX. Conclusion 

This paper addresses the sparse recovery problem with 
graph constraints. We provide explicit measurement construc- 
tions for special graphs, and propose measurement design 
algorithms for general graphs. Our construction for a line 
network is optimal in terms of the number of measurements 
needed. The constructions on other graphs also improve over 
the existing results. We characterize the relationship between 
the number of measurements for sparse recovery and the 



graph topology. We also derive upper and lower bounds of the 
minimum number of measurements needed for sparse recovery 
on a given graph. It would be interesting to tighten such 
bounds, especially the lower bounds. 

We have not considered the effect of the measurement noise. 
Also, we assume full knowledge of the fixed network topology, 
and measurement construction when the topology is time- 
varying or partially known is an open question. 

Appendix 

A. Proof of Theorem \3\ 

Let A mxn denote the matrix with m realizations of the n- 
step Markov chain. To prove the statement, from 1101 . we only 
need to show that the probability that every 2k columns of A 
are linearly independent goes to 1 as n goes to infinity. 

Let Ai be a submatrix of A with columns in /, where / is 
an index set with |/| = 2k. Let A Sj i (1 < j < |_f|J) be a 
submatrix of Aj formed by row 2k(j — 1) + 1 to row 2k j of 
A]. Given /, the probability that rank(As ./)< 2k is the same 
for every given j, and let it denoted by ir^. Let Pj denote the 
probability that rank(A/)< 2k, then 

^1<(^) LSJ - d5) 

To characterize it^, consider matrix B 2kx2k with Bu = 
for i — 2, 3, ...,2k and By = 1 for all the other elements. 
Since rank(i?)= 2k, then 

7r^ < 1 — P(As J i is a row permutation of B). (16) 

One can check that in this Markov chain, for every 1 < i < 
k < n, P(X k = 1 | Xi = 1) > 1/2, P{X k = | Xi = 1) > 
1/4, P(X k = 1 | Xi = 0) > 1/2, and P(X k = 1) > 1/2. 
Since B has (2k)\ different row permutations, 

P{A SjI is a row permutation of B) > (2fc)!/2 4fe2+2fe ^ 1 . 

(17) 

Combining {T51 . ( [To*] ) and ST% , we have 

P (every 2k columns of A are linearly independent) 
= 1 - P(rank(A/) < 2k for some / with |/| = 2k) 

>i- ( n \pi>i- f"V (2fc)!( 5 )4fc2+2fc " LsJ , 

\2k) d ~ \2k) 

where the first inequality follows from the union bound. Then 

if to = g{k)logn = {2k + l)2 ik2+2k ' 1 \ogn/(2k - 1)1, 
the probability that every 2k columns of A are linearly 
independent is at least 1 — l/((2fe)!n). 

B. Proof of Proposition Q] 

We view nodes 2i — 1 and 2i as a group for every i 
(1 < i < L§J)> denoted by Bi. Consider the special case 
that for some t, both nodes in B t are 'l's, and all other nodes 
are '0's. Then every measurement that passes either node or 
both nodes in B t is always '1'. Consider a reduced graph 
with Bi, V« as nodes, and link (Bi,Bj) (i ^ j) exists only 
if in Q 4 there is a path from a node in Bi to a node in Bj 
without going though any other node not in Bi or Bj. Bi is 
'1' if both node 2i — 1 and node 2i in Q 4 are 'l's and is 



'0' otherwise. The reduced network is a ring with [f J nodes, 
and thus [n/A\ measurements are required to locate one non- 
zero element in the reduced network. Then only to locate two 
consecutive non-zero elements associated with Q 4 , we need at 
least [n/A\ measurements, and the claim follows. 

C. Proof of Theorem \6\ 

Since checking whether or not r given sets form an re- 
partition takes polynomial time, r-partition problem is NR 

We next show that the NP-complete r-coloring (r > 3) 
problem is polynomial time reducible to r-partition problem. 

Let G = (V, E) and an integer r be an instance of r- 
coloring. For every (u, v) € E, add a node w and two links 
(w, u) and (w, v). Let W denote the set of nodes added. Add a 
link between every pair of nodes in V not already joined by a 
link. Let H denote the augmented graph and let V' = V U W. 
We claim that if there exists an r-partition of H, then we can 
obtain an r-coloring of G, and vice versa. 

Let Si (i = 1, ...,r) be an r-partition of H. Suppose there 
exists link (u, v) 6 E s.t. u and v both belong to Si for some 
i. Let w denote the node in W that only directly connects to 
u and v. If w € Si, then w has both neighbors in the same set 
with w, contradicting the definition of r-partition. If w ^ Si, 
then Hyi\$. is disconnected since w does not connect to any 
node in V'\Si. It also contradicts the definition of r-partition. 
Thus, for every (u, v) E E, node u and v belong to two sets 
Si and Sj with i ^ j. Then we obtain an r-coloring of G. 

Let Ci C V (i = 1, ...,r) denote an r-coloring of G. We 
claim that Ni = C» (i = 1, r- 1), and N r = C r U W form 
an r-partition of H. First note for every u e V, at least one of 
its neighbors is not in the same set as u. For every w £ W, w 
is directly connected to u and v for some (u, v) £ E, and u 
and v are in different sets Ci and Cj for some i ^ j. Therefore, 
w has at least one neighbor that is not in N r . Second, we will 
show Hy\Ni is connected for all i. Hy'\N r is a complete 
subgraph, and thus connected. For every i < r, let Si := 
V\C U then V'\Ni = Si U W. H Sl is a complete subgraph, 
and thus connected. For every w £ W, since its two neighbors 
cannot be both in Ci, then at least one neighbor belongs to Si, 
thus H V '\ff r = Hs^w is connected. Ni (i — l,...,r) thus 
forms an r-partition. 
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